Text Rank: A Novel Concept for Extraction Based Text Summarization
نویسندگان
چکیده
Indexing used in text summarization has been an active area of current researches. Text summarization plays a crucial role in information retrieval. Snippets generated by web search engines for each query result is an application of text summarization. Existing text summarization techniques shows that the indexing is done on the basis of the words in the document and consists of an array of the posting lists. Document features like term frequency, text length are used to assign indexing weight to words. Hence indexing weights of the document words are used to calculate the sentence similarity value between document words which remains independent on context. The word based index seems to be less efficient due to information retrieval problems like polysemy and Synonymy. Thus the significance of term for building the index is reduced and the emphasis is laid on the context of the document. This paper proposes an indexing structure in which index is built on the basis of context of the document rather than on the terms basis. While doing so we have also used novel concept of Lexical association (semantic association) between document words to calculate the similarity between sentences using computed indexing Weights. The proposed concept of sentence similarity measure has been used with the graph-based ranking method to create document graph and get summary of document.
منابع مشابه
EXTRACTION-BASED TEXT SUMMARIZATION USING FUZZY ANALYSIS
Due to the explosive growth of the world-wide web, automatictext summarization has become an essential tool for web users. In this paperwe present a novel approach for creating text summaries. Using fuzzy logicand word-net, our model extracts the most relevant sentences from an originaldocument. The approach utilizes fuzzy measures and inference on theextracted textual information from the docu...
متن کاملSystematic literature review of fuzzy logic based text summarization
Information Overloadrq is not a new term but with the massive development in technology which enables anytime, anywhere, easy and unlimited access; participation & publishing of information has consequently escalated its impact. Assisting userslq informational searches with reduced reading surfing time by extracting and evaluating accurate, authentic & relevant information are the primary c...
متن کاملBiogeography-Based Optimization Algorithm for Automatic Extractive Text Summarization
Given the increasing number of documents, sites, online sources, and the users’ desire to quickly access information, automatic textual summarization has caught the attention of many researchers in this field. Researchers have presented different methods for text summarization as well as a useful summary of those texts including relevant document sentences. This study select...
متن کاملOdia Text Summarization using Stemmer
Lot of work has already been done for automatic text summarization. In this paper we have given a novel statistical approach to summarize the given Odia text. In our approach extraction of relevant sentences is done which can give the actual concept of the input document in a concise form. We rank each sentence in the document by assigning a weight value to each word of the sentence. The senten...
متن کاملText Summarization using Term Weights
Lot of work has already been done for automatic text summarization. In this paper we have given a novel statistical approach to summarize the given text. In our approach extraction of relevant sentences is done which can give the actual concept of the input document in a concise form. We rank each sentence in the document by assigning a weight value to each word of the sentence and a boost fact...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014